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I. INTRODUCTORY 

1. Since in other brandies of science symbols Ijearing an objecti\'e 
or logical significance have been usefully employed, conjoined with 
symbols of number, to express quantity, it may be expected that in the 
a^,^,^.,.^ rvfu+oticitjoi, wViir-Vi nf its; Assfince is the enumeration of logical 



(4^1 + 4-^2+ ■■• +'8-^.; \oj 

Since each way of sampling is equally likely to occur, dividing by 
I", the number of ways, we obtain 

(p,Ai+p,A, + ...+p.A,y, (4) 

the 'frequency array' of samples of n. 
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If like terms are collected, the expansion, by the multinomial 
theorem, is 

^Pn„n,...n,A,''^A,"^...As\ (5) 

where i>»i,«,...n,= "" ^ p{Hp^^-i...p,^>, (6) 

IV\ ', It^ . • ' • It's • 

«j, Wj ... having all values from to m subject to 

«1 + W2 + . . . + «s = M. 

Here the coefficient pni,n2...ng is the frequency, or probability of 
occurrence, of the draw characterised as containing ih individuals of 
the kind Ai, n^ individuals of the kind Ao_, and so on, and sym- 
bolised by 

^j"lJ.2"2...^,"» (7) 

It will be seen that the cmistitution of the sample, as shown in (7), 
differs in its mode of expression from the constitution of the population, 
as shown in (1), in that the individual constituents are associated in 
product, instead of in sum, the number of each kind of constituent 
appearing as an index instead of as a coefficient. 

Now it is clear that this way of expressing constitution, that is to 
say by symbols associated in product, which is presented as the natural 
consequence of invoking the distributive law of algebra for exhibiting 
and enumerating the possible samples of n drawn from a population, 
may, by convention, be extended to indicate tlie constitution of a unit 
in the several characters it possesses, whether these characters are re- 
garded as the result of a sampling process or as given in any other 
manner. 

Any population, then, formed of an aggregate of units of diverse 
constitutions, can be spoken of as shown in ' sample ' array when ex- 
pressed in a form such as (5), that is to say, in the form 

%PaJ,o...A''B''C' ..., (8) 

to indicate that pahc... is the frequency of occurrence of an individual 
possessing the character A in the degree a, the character B in the 
degree h, and so on. 

In the extended definition given to the symbols in (8) it is not 
necessary that the indices have the integral values proper to (5). 
According to the natures of the several characters A,B, C ... and their 
mode of measurement, the indices may take fractional and negative 
values and, if any of the characters are continuous variates, signs of 
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integration will replace signs of summation. Thus if all are continuous 
vaviates the frequency array (8) will assume the form 

■ ..f{a,b,c...)A''B''("... dudbdc..., (Ii) 

with the meaning thui f {a, b, c ...) dadbdc ... individuals are enu- 
merated as possessing the combination of measurements a in the 
character ^4, b in the character B, and so on. 

3. It may be observed at this point that the introduction of the 

symbols into the expressions describing statistical counts supplies a 

feature which enables the whole distribution to be exhibited in a single 

expression without ambiguity or reserve. Thus the binomial and 

Gaussian frequencies, expressed as 

U={iUp.AY, (10) 

-l^'l^'dx 

■' (11) 



-\: 



are complete embodiments and may enter as they stand into mathe- 
matical analysis. Without the symbols 

{p'+pf and == 

./-GO ..^ l7r<T 

stand only for unity until the reservation is made that the conjunctive 
signs between the expanded terms or differentials are to be ignored. 

4. As a simple example of the advantage that may accrue from 
assembling all the eventualities in a single expression, let us apply 
arrays to the well-known problem of 'points.' A and B play a game 
for a point, in which .4 's chance of success is p and of failure p =\ -p. 
The game is repeated as often as A wins and the contest closes when 
A loses. What are A'>i chances of winning 0, 1, 2 ... points? 

The symbol A indicating a win to the player A, \etf{A) be the 
required frequency array of total points won by .4 in a contest, sup- 
posing the contests continued without limit. The chances of the first 

game are shown as , , 

^ p +pA, 

indicating that p' is the chance of an initial loss and p the chance of 
an initial win. 

In the latter event the chances of subsequent wins are arrayed as 



.■.f{A)=p'-^pA/(A). 

•■•^^^^=(f-^0'' (^^> 
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If the conditions are varied, the contest closing after n losses to A, 
we are sampling the array (12) n times in succession, n such samples 
bringing A' a losses up to n. Thus ^'s expectations of winning 0, 1, 2 ... 
points in a contest under these conditions are arrayed 

(^-P-,a)-\ (13) 

the negative binomial series of frequencies. 



II. ELEMENTARY PROPOSITIONS. MOMENT ARRAYS 

1. Attention will now be directed to a few elementary propositions 
applicable to frequency arrays as above defined, which may be described 
as complete statistical enumerations, arrayed as a series of terms, each 
term comprising a numerator, or count, multiplying a denominator or 
arrangement of symbols in product formation to indicate the characters 
borne and their amounts. 

2. Clearly, if 

f{A) = -%prA'; or f{A) = jp{a:)A-da:, 

are arrays in A, multiplication by A™, or 4"™, respectively augment, 
or reduce, all the indices in the array by m and so throw back, or 
advance, the origin of measurement of A m units. For instance, if m is 
the mean of the array /(J.), 

A-^f{A) 

will represent the array referred to its mean as origin of measurement. 

(li) 

Again the substitutions A = J-/, or A^ = A^, respectively diminish, 
or increase, the unit of measurement s times. For instance, if o- is the 
standard deviation of the array /(J.), by putting A'' = Ai, we get 

as the representation of the array with unit of measurement so changed 
as to make the s.d. equal to unity. (15) 

If U=f{A, B, G ...) array the frequencies with which varying 
quantities oi A, B, C . . . are found in the units of a population, f/" will 
array the frequencies with which total A , total B ... will be found in 
samples of w drawn at random from U (with replacement if the popu- 
lation is not a large multiple of n). (16) 

In like manner U'' F™ IF" . . . will array the totals of the characters 
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A , B, C ... found in samples of ^ + w + « + . . . drawn, / from a population 
U, m from a population V etc. possessing these characters (17) 

If samples of varying number are drawn, with replacement, from 
U, the number r being drawn with frequency p,-, then 'number in 
sample ' is a character which, if its symbol be <S', has the array 

<t>[S)=p,S+p.,S'+p,S'+ (18) 

By (16), if C/is put for .s' in the right-hand side of (18), we shall 
have, arrayed, the frequencies of total A, total B etc. in the variously 
sized samples. 

Hence if U is sampled in variable numbers, the number in sample 
having the frequency array <i> {S), the totals of the characters to be 
found in the samples will have the frequency array 

•i>{U) (19) 

In any frequency array the suppression of symbols gives the fre- 
quencies of the larger classes. For instance, if/(^, B, C) arrays the 
frequencies of A, B, C, putting B= 1, C= 1 gives/(^4, 1, 1) the total fre- 
quencies of ^4 . If all the S3rmbols are suppressed in a frequency array the 
result is, of course, the total frequency, generally taken as unity. . . . (20) 

3. Moment arrays. Of special utility are the symbols we have con- 
sidered in the development of the moment coefficients of frequency 
distributions * 

* Smooth mathematical frequencies are often fitted to the rough frequencies of 
samples by equating the moments. Perhaps the justification rests in the following 
proposition. 

If the frequencies to be fitted have coetBoient of the form 

/=/o + Sr,,.. .'>,....*'■ !/'•■• (i) 

(including the general Gaussian), such fitting by moments will render the probability 
of the sample a maximum for the given classification, dxdy...'\, that is, will produce 
the 'best' fit. 

For if n repeats, in product, values for the n units of the sample, and m^^ ,... stands 
for the r, 8... moment coefiicient of the sample, the probability of the sample is, by (6), 

n\-ae'<' + ^'^r.,...^''y'- {dxdxj...Y = n\e^'^o + nSc^.,...'<tir.s... (dxdy. ..)'', 
and is a maximum when Cq+ Sc^^g,,, ^r.s.-- ^^ ^ maximum. Hence variations 5cq... in 
the constants must conform with 

5c„ + .S'mr,,... S''r,,... = 0. 

But since lfdxdy... = l the variations must conform with 
dc„ + Sm',^ .... Sc,.. .... = 0, 

where mV t ^^ *^^ ^' ** ■- uaoment of the assumed universal (i). 

Hpncem',, ,... = »!,, ,... or, for best fit of (i) to a series of observations, the moments, 
corresponding to the terms of the polynomial exponent, must be equal in universal 
and sample. 

t There are no 'probabilities' until the classification is specified. 
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In the first place, considering a single variate, if 

/(^) = 2jt>,4»' (21) 

X 

arrays the frequencies, j»a;, with which the measures, x, of the character, 
A, occur in a population, the substitution in (21) of 

A=6'' (22) 

gives /(e°) = 2j33,e^''=2ji>a:(l + !>sa + ai^'^^+ ...\ 

a" a' ,„„, 

= l + mia + ?K2 — + ... +W2^— + ..., (23) 

which may be termed the 'moment array' of the character A, the rth 
moment, nir, of the distribution, about the origin, being the coefficient 
of a^jr ! in the array. 

The same is true if ^ is a continuous variate arrayed as 



f[A)=L{x)A''dx. 



In the second place, if there are several characters A, B, C ■■., the 
substitutions, A=e^, B = eF ..-, render 

A''BiC''...=e"'e'-'^e''' ... 



(l+a-a + ^J|+...)(^l + 2//3 + y^^+...) 



and render the frequency array an array of moments whose symbols 
betoken the kind of moment and whose coefficients record the value of 
the moment. 

The ' moment array,' then, is the result of substituting 

A = e% B=e^, C=e^ ... 
in the ' frequency array ' and has this property that, when expanded in 
integral powers of the new symbols <x, ;8, y ... , it gives the r, s,t ... 
moment of the characters ^, B,C ..., about the origin, as the coefficient of 

-^/^V- (24) 

r\s\t\... ^ ' 

in the expansion. 

It will be convenient, in the sequel, to express arrays indifferently 
in terms of the capital letters, signifying character units, or in terms 
of the corresponding Greek letters, having an interpretation in reference 
to moments ; and it will be understood that the symbol A may be 
replaced by e°- and the symbol a by log A in the course of any process, 
if it is desirable to do so, the expression still being called the ' array.' 
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In the end an array may be expressed in character symbols in the 
form f(A) which when expanded in integral or fractional or differential 
powers of ^4 arra3's the frequencies ; or, it may be expressed in moment 
symbols, in the form <^ (a), which, when expanded in positive integral 
powers of a, arrays the positive integral moments of the distribution. 

4. When arrays are expressed in the moment symbols, propositions 
(14), (15) and (20) may be replaced by (-25) and (26) : 

Change of origin to m units in advance is effected by multiplication 
of the array by e'""^, whilst the substitution, sa = s'a, effects a change 
of unit, t^ units of the character A having as equivalent s units of the 
character ^4 '. (25) 

Suppression of symbols to give the moment arrays of the larger 
classes is effected by writing them zero. For example, if <^ (a, /?, y) is a 
moment array, expressing, by expansion, the moments in the characters 
^4, B, C, suppression of A 7> by writing P = 0, 7 = 0, gives <^(a, 0, 0), 
the moment array of the character A. (26) 

5. Linear transformations of the variates are effected by linear 
transformations of the moment symbols. For, using dashed letters for 
the new measures and symbols, 

A"'B'«'C'''...=A'^B'C'..., 

.'. xa! + y'P + :'y + ...=a;a.+yli-t- zy + ... 
for all values oi ,r, y, z .... 

Hence if the transformation is to be 

X = c^T,x + c-^y + ..., 
y' = c^x + c.,.y+ ..., 



this is effected by the substitution 

a = Ciia' + C21/3' + ..., 

^ = Ci2a'+C22^'+..., 

(27) 

Having now explained the meaning of the symbols and the purpose 
of their introduction among the counts or frequencies to give full ex- 
pression to the statistical distributions and having indicated some of 
the more elementary properties that flow from their definition, we will 
next illustrate their use by applying them to propositions for the most 
part of a familiar nature. 
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III. BINOMIAL, POISSON, GAUSSIAN, EXPONENTIAL 
AND GAMMA TYPE FREQUENCIES 

1. If the character A is present in the proportion ^ of a population 
and absent in the proportion p' = 1 -p, the population is represented 

by the array 

p'+pA, 

and the number of ^'s found in samples of n drawn at random (with 
replacement) will have the frequency array 

(P'+P^T, (28) 

and hence by (23) the moment array 

/ a^ \™ 

(p' +pe'^Y= [i^+P<^+PK]+ ■■■) =1 +npa.+ .... 

The mean is therefore np and, referred to the mean, the array is, 
by (25), 

= Y- + PP'^+PP'(P'-P)-Q]+PP'(P'^-P'P+P'')JI+ ■■■] • -(29) 

The moments of the binomial (28) about the mean are, therefore, 
the coefficients of a72 !, a^/3l ... when (29) is expanded by the multi- 
nomial theorem. For instance, the second, third and fourth moments are: 
npp', npp' (p' —p) and npp' {p"^ + {3n — 4:)p'p +p^}. ...(30) 

Since moments of low orders only are, as a rule, required, the ex- 
pansion of such expressions as (29) presents no difficulties, and in the 
sequel it will often be deemed sufficient to leave the moment array in 
the unexpanded form, as a useful formula, from which moments of any 
given order may readily be derived. 

2. The Poisson limit to the binomial frequencies is reached by 
supposing the character A to be very scarce in the population and the 
samples taken to be very large in number. If the mean np remains 
finite and its value is denoted by m, the frequency array of the number 
of A'a in samples of n becomes, in the limit, when ^ = 0, w = co , 

{p'+pA)"- = {l+p{A-l)}'^ = e-^^^-'^^ (31) 

This, expanded in powers of A, gives the well-known Poisson fre- 
quencies, 

e-'^(l+mA+j^A^ + ...\ 

and, putting e" for A, gives the- moment array e^l^^-l) about the 
origin. 
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The coefficient of a is m; the mean is therefore m and, multiplying 
by (•""'", we get the array, referred to the mean, in the form 

^m(e«-l-a) = ,,"'W2! + a'/3! + ---), (32) 

whose expansion as e"^°''l^' x e'"" Z^' x ... is easily effected by ordinary 
multiplication, to any required number of terms, eacli term giving a 
moment in accordance with ('i.-l), the '2m\, 3rd ... moments being 

m, m, m + Sm'-, m + lOm^, m + 25m^ + 15;«'* ...(32') 

3. Poisson frequencies are of interest in connection with samples of 
variable number. 

If the population has frequencies p, q ... in the separate compart- 
ments of any classification, it is arrrayed 

U--pA +qB + ..., 
and, if samples of fixed number n are drawn, with replacement, they are 

arrayed 

U" = {pA + qB+ ...r^ 

The numbers in any two compartments A , B are therefore corre- 
lated; in fact, by suppressing the remaining symbols (writing them 
unity), the correlation is arra3fed 

{l+p(A-l) + q(B-\)Y^ 

If, however, 'number in sample' varies, with Poisson frequencies, 
about a mean m and has, therefore, the array 

gm(S-l| 

then, by (19), by putting Cfor <S', the samples have array 

Qj. gmljiA+qB+...-\) 

or g».pM-i)_g7n«(B-i) (33^ 

Thus we obtain the interesting result that if samples are drawn, with 
replacement, from a population, enumerated under any simple or com- 
plex headings, with Poisson frequencies about a mean m, the numbers 
in each separate cell of the classification vary, in complete independence 
of one another, with Poisson frequencies about their individual mean 
values t. 

* These moments are calculated by 'Student,' Biometrika, vol. v. (1906-7), 
p. 353. 

t This result has been pointed out before. See a reference in Pearson's 'On the 
Theories of Multiple and Partial Contingency,' Biometrika, vol. xi. (1915-17), p. 149. 

By means of it a simple proof of Pearson's expression for the probability that a 
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4. The simple Gaussian, or normal, distribution may next, for the 
sake of further illustration, be derived as a second limiting case of the 
binomial frequencies, although the multiple Gaussian will, in a sub- 
sequent section, be derived more generally. 

To obtain the transformation, the number n of the sample is again 
supposed to approach infinite values, but now it is the magnitude of the 
element A in (28) that becomes infinitesimal, the standard deviation, 
which by (30) has symbol A^^PP', remaining finite. 

Let us then, in accordance with (15) or (25), change the unit from 
A to B, writing A'^iW = B'^, or Jnpp a^^li, in (29), representing the 
binomial (28) referred to its mean. This is now, therefore, 

/ l^ 1 p'-po-^jS;^ 1 p"'~p'p+p '<T^fi* Y 



or 






where q stands for (^;' -^)/2 Jp]}'. 

given set of deviations from expected values shall arise in the cell numbers in a 
random draw of M from any universal, classed in n cells, is reached. 

For suppose that in place of drawing a fixed number M we draw variable sample 
numbers, with Poisson frequencies about a mean number 31. Then, by what is here 
shown, the deviation e in any cell number varies independently of the other cell 
number deviations, with Poisson errors about the cell mean in. And by (32') the 
standard deviation of e is ^m. Now it is known that the Poisson frequencies approxi- 
mate to a Gaussian law if the mean m is not too small. Hence, with this proviso, 
the error of the cell number has Gaussian deviate x = el Jm and probability 

e-^^\lxlj2^\ 
The probability of the set of n deviations is, therefore, 

and, if this is expressed as an element in n dimensioned space with tensor x and 
polar angle u, it is 

(ljsj^)'^e-i^\"-'^dxd,^. 
Of these extended probabilities we require those only (divided by their integral) 
for which the total of the cell numbers is iV, exactly, or for which Se = or 
2 Jm .x = 0. This plane will cut the n dimensioned spheres in a series of n — 1 
dimensioned spheres and, if w' is the polar angle in ii - 1 dimensioned space, the 
probability of the draw is 

and the probability rank of the draw is 



/: 



\-ix\n-2a^l[ ^-i,^^„-2,,^_ 



where x^=Se2/m. [See PhiJ. Mag. vol. ... (5th series) (1900), p. 157.] 
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(34), then, shows in array form and as a series in powers of \jjn the 
binomial frequencies referred to the mean as origin of measurement 
and referred to any unit of measurement, the s.d. of the distribution 
having the measure o- in that unit. 

Lea\'ing for the present the interpretation of the succeeding ap- 
proximations*, we have as the first approximation to the binomial 
frequencies, when n is large, the array 

Now, the resolution for/(a;) of the identity 



J —CO 



or of g i <r= (log Bf^l f (.r) 5 ■■ dx, 



in other words, the expansion of a function in powers of its argument 
by infinitesimal increments is the solution of an integral equation. 
In the present instance it is easily seenf that the solution is 

1 _i5^ 
f{x) = — e 2^. 

v27r(r 

and, accordingly, that the limit of the binomial frequencies, when n is 
infinite and the step ^4 is infinitesimal, is the normal curve of error. 

5. It follows from the last paragraph that the Gaussian is very 
simply expressed as an array and this simplicity is a feature generally 
characterising the method of description employing symbols. 

The expression e^'^"'^^ (35) 

completely describes the normal distribution and not only gives a ready 
means of obtaining the moments, by expansion in powers of a, but is 
an ensemble that may be handled as a unit, in any process, for the 
derivation of further distributions. 

To take an instance, samples of n drawn from (35), by (16), have 
their totals arrayed {e^'^^'^Y, and their means (by the substitution 

na = P) arrayed e^ m . The means, therefore, of samples of n drawn 
from an infinite normal universal, have normal distribution also, the 
S.D. being reduced to l,'v''«th of the original value. 

6. In precisely the same way the correlated Gaussian may be derived 
as the limit reached when samples of very large number n are drawn 

* See p. 24 seq. t Or it will follow from (105), p. 46. 

s. 2 
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from a population in which are distributed, but not independently, two 
unit characters, B and G, and the total B and total G are observed in 
each sample. 

If the frequencies of B and woi-B are p and p B 

andof Candnot-Careg'and9''and«,^», c, (^are the a h q' 

frequencies of the conjunctions wot-B not-C, B not-C 

etc., as shown in the figure, the frequency array of C' c d q 

the universal is p' p ^ 

a + bB + cG + dBC, 

or l+p{B-\)+q{C-l) + d{B-l){C-l), 

or l+p{P + \^'' + ...) + q{y + h'+---) + d{Py +■■■)' 

and, referred to the means p and q by multiplication by e"^^, «-«>■, this 

becomes 

1 + Ipp'^^ + I ggV + (<^ -P9) Py + liigber orders (36) 

If this is raised to the power n, it will give us the moment array of 
the totals in samples of n. 

The s.D.'s are therefore Jnpp', Jnqq and changing the units, by 
writing Jnpp'P^^r-.ji', sJnqq'y = (T^y, to show arrays with S.D.'s equal 
to <Ti and 0-2, the array of samples of n is 

fl -(--{i 0-1^/3" + 4 cr,^y'' + »-'^i^2/3VKi {cubes} + ...T, ...(37) 

where r = {d -pq)l'Jpp'qq'. 

In the limit, therefore, when n is very large, the array is 

gio-i^/S'^ + irr^V' + rffiff^lSV (38) 

This is readily seen to be the value of 

-oo 27r<rjO-2\/l — r 

and the same moment array (38) is therefore reached whether as the 
limit of the totals in large samples drawn from a universal having a con- 
tingent relation between two unit characters discerned in the' members, 
or as the array expression of the Gaussian. 

It will be observed that, when samples are drawn from such a uni- 
versal, the correlation coefficient remains the same, whatever the numbers 
drawn, and is equal to the correlation coefficient of the original popula- 
tion, vi:;. {ad - hc)\'Jpp'qq. 
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7. A more general derivation will be sought for the multiple (laussian, 
than b}"- regarding it as the limit of contingent binomial distributions, 
but its array expression can be indicated here, and is clearly 

l,i<^lW + 4"'2-='2- + • ■ • + '•l5<ril^2'«l<'2 +---^ (39) 

extended to any number of variates; ctj, o-^ ... being the variabilities of 
the variates and ;-,2 . . . the coefficients of correlation between them. 

For in the first place the form, that is, the quadratic index, is a 
necessary consequence of the integration of an expression such as 



/:/: 



... A-e~(i™''™'*'<' '"•'''■ ■'-•■■■' g»i''i+^2'>3+-- da\dir«- 



whilst in the second place the constant multipliers in the index must 
be such as to give the second and product moments as coefficients of 
the S3rmbols when (39) is expanded as a moment array. 

Hence (39) represents the general correlated Gaussian frequencies 
referred to the means. If expanded in integral powers of aj, 03 . . . it gives 
as the coefficient of ai''a/.../?-! si... ther, s... moment. If ai=log^li etc. 
and it is expanded in powers of .A,, ^4^ .., by infinitesimal increments, 
from — X to 3c , it gives as the coefficient of Af\ Af-^ ... the normal 
frequencies. 

If the means, instead of being at the origin, are at mi, m^ ..., the 
general Gaussian, by (25), has the array 

g%ai + 77i2a2+ ... +^(ri%^ + i:ro-a;2+ ... +1\2(riO-2<'.ia:2+ .■■ /^Q\ 

8. "We may now enquire under what conditions the general Gaussian, 
as represented in array form in (40), will be the law of distribution of 
the observed effects, when these are brought about as the result of an 
infinity of small independent contributory causes. 

If ^1, A,... are symbols in designation of the observed effects, and 
J\(Ai, A2 ...) array the frequencies of the amounts of Ai, A^ ... con- 
tributed by one, the sth, cause, tlie joint contributions of all the causes 
are, by (17), arrayed 

U fs{AuA.....). 

8 

Let the expansion oifsiA,, A-i ...) in moment symbols be 
/«(e"', e»2...)= 1 + «Ziai+ n/.a.. + ... + ^ o-j^aj^ + lo-jV- + ... 

SO that mi, m,_-.- are the means, <r^, a} ...Vi^uiT^ ■■■ the second and 
product moments from the means, fieri, y8'o'i"<^2 ■■• the higher moments 
of the distribution of effects peculiar to this contributor. 

2—2 
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Then, if the individual means nii, m^... and standard deviations 
o-j, 0-2... are infinitesimal and the number of contributors is infinite, 
but so that the sums [wj, \m^ ... [o-j^], [o-g^] ... are finite, in the limit, 

TLfAA-^ J^2...) = g['»i]ai + ['«2>2+--- + 4[<^i^«i^+---+[''a2'^i<f2]''A+---, 
s 

the general Gaussian. 

The assumption that has been made in drawing the above conclusion 
is that the higher moment ratios p,ji'..., obtained by dividing the higher 
moments of each contributory distribution of fortuities by the corre- 
sponding powers of their standard deviations, are all finite. This, at any 
rate, appears to be a sufficient condition, in addition to the conditions 
named above, that may be supposed to govern the fortuities of the inde- 
pendent constituents, to ensure a resulting Gaussian law of distribution. 

9. We have shown how the array expression for the Gaussian distri- 
bution may be obtained by integration of the frequency curves and 
surfaces. Sometimes the reverse process is desirable and, in illustration, 
we will derive some properties of the tri-variate Gaussian and solve a 
problem in selection by expanding (in part) the frequency array. 

The expansion of the tri-variate Gaussian, referred to means as origin 
and s.D.'s as units, with respect to variate At_, is performed as follows : 

= gi {% + '■i2«2 + '•is^s)^ X e* (-^ " '■12^) ctg^ + 1 (1 - rijS) ctgS + (7-23 - r^^r^^) a^a^ 

= r l_J-eK+'-i2'^+'-i8%)%^^^x the same (41) 

J-«> J2ir 

V27r 

X g''i2^i«2 + '■13^1% + i (1 - 'la") "2^ + 4 (1 - 'm^) 'h? + ('■23 - ''la^'is) <^2% 

and, in this form, the distribution is shown resolved into a normal 
frequency array of a^i's and each a^i array distributed as a partial normal 
array of x^jXs with the well-known means t\2Xi, r^gXi, standard devia- 
tions Vl - J\2^, Vl -ri3^ and correlation coefficient 

'r2s~r^ 2 rn 

Problems in selection or 'changing the frequencies' may be worked 
out by expanding the array with respect to the frequencies to be changed 
and, after the change or selection has been made, reintegrating into 
array form, the remaining variates being retained in this form throughout. 
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Thus, suppose that the variate A^ in the above distribution is 
'selected' to show a normal distribution, with mean h and s.D. s, in 
place of a mean o and s.d. unity. The distribution has been expanded 
with respect to this variate in (41) and, now, modifying the frequencies 
in accordance with the desired selection, (41) becomes 

-^^^ e ("i + '"15'^^' + ns'^i) ^1 dj; 

.'-co V2-.-; 

X f> i (1 - r,J) 02^ + 4 (1 - 7-13=) ag^ + (r„3 - iur^,)cu,a3 

showing that ^4^;, A^ have means, s.d.'s and correlation coefficients 
changed from 0, 0, 1, 1, ^23 to 

riih, >\,h, Vl - ri2^ (1 - r), Vl - rj (1 - r), 

[r^ - r,,rrz (1 - s')}Hl-r,i (1 - s') Vl^- r^^l - •'=)*■ 

10. Gauman rJericatices. It has been shown by Pearson that the 
quadrature of the correlated Gaussian, between given x and y planes, 
may be effected by expanding the frequencies in powers of the corre- 
lation coefficient r. 

This expansion is readily arrived at when S3Tnbols are used and an 
interesting relation is shown. 

When the two characters ^4 , B are referred to their means as origin 
and standard deviations as units of measurement the normal array, by 
(3s„ is 

and its expansion in powers of r is 

= 2 aVi-^'.^vi^'.^, (4-2) 

s=0 S\ 

showing the coefficients as the products of two factors, one an array in 
A and the other in B. 
The array, 

is of interest. Clearly the total and first s - 1 moments are zero, since 
the expansion begins with the term a'; and negative frequencies are 

* In his paper 'On the Influence of Natural Selection on the Variability and 
Correlation of Organs,' Phil. Trans, vol. 200a (1903), Pearson develops the general 
formulae and on p. 25 gives this case as an instance. 
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therefore involved. The integral equation is easily solved and the 
resolution into frequencies is 



^*"^-£(-0(7i^)^"^^^' <^^) 



as will be seen by performing s integrations, by parts and finally using 
m, 4. 

The array a'e^"' therefore represents the sth derivative of the normal 
curve of error (with sign adjustment) and is the function considered by 
Thiele* who tables the polynomials and their zeros. 

It follows then that the identity, (42), between integrated arrays 
has, as its expanded equivalent, 

£ L ^^'^^-dy=l £ {-p^^-^''- ■ £ (-|)^-^^^^ • .1' 

^here z= ^ .-2-(i^(^'+2/^-2"^), 

27rVl-r^ 

Zx = —=6~i'''' and z,j=--=e~^y\ 

\/27r \/2rr 

and it follows that 

==j.(-i)'-(-|y-.n *"> 

and that 

.£ £ "'^"^^ = Jo (- 0" '■■■' • (- tT '^ ■ ?! '■ 

The coefficients of r* in the expression of the integral, from given .r, y 
planes, of the correlated Gaussian, as a series of powers of the corre- 
lation coefficient r, were found by Pearson and tabulated by Everitt J, 
under the name of Tetrachoric Functions, which are thus seen to be 
related with Thiele's Gaussian derivatives. 

The extension of the series expression for the ordinate to Gaussians 
with more than two variates is easily made and the expansion of the 
tri-variate Gaussian array, 

* Almindelig lagttagelseslaere : Sandsynligheds regning og Mindste Evadraters 
Methode, T. N. Thiele, 1889. Thiele omits the l/J^. 

+ In the first term, — -« will stand for / i^dx. 

d / _oo 

J Phil. Transi A, vol. 195 (1903), p. 1 and Biometrilia, vol. vii, 1910. Here 

§ The sign X abbreviates ' to the power of.' 
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in powers of ;•„, i\.s, r.,_, leads to the result 

i~oj^ok=o\ d.rj -'A (/jj ''A cLJ ~'-^i] J\ /■!' 

(45) 

where r now stands for the ordinate of the general tri-variate Gaussian 
distribution referred to the means as origin and standard deviations as 
units of measurement and z^ -j, -s as before stand for the ordinates of 

the simple Gaussian totals, --p= e~ ^a^i^ etc. 

11. From the expansion of the ordinate of the general Gaussian in 
powers of the correlation coefficients, as shown in (44) or (45), not only 
may the integral of the frequencies, between given variate limits, be 
found, but the successive integrals or moments* for such blocks of the 
solid may also, from the form of the resolution, be readily calculated. 

The calculation of such a part-coefficient as 1 ^'' (--t-) -xdx may 

be performed, either by obtaining the polynomial multiplier of z^ in the 
integrand and using Dr Lee's table of the values of the incomplete 
normal moment function t, or by integrating by parts, when there results 
a series of tetrachoric functions multiplied by powers of x, followed 
(lip $ s) by a single moment function of order p - s. 

12. iloreover the expansion (44) may be applied to extending 
Pearson's equation for r % from tetrachoric to polychoric groupings. If 
the classification gives more than a simple dichotomy of each of the 
two variates the problem becomes one of probability. Assuming a normal 
distribution of the marginal totals, without errors, and thence a know- 
ledge of the values of the class divisions on the normal scale, no assumed 
value of r will, in general, give all the observed cell numbers. 

We will find the probability of the observed numbers for any assumed 
;• on the supposition that the errors are due to random sampling and, 
by making this a maximum, obtain an equation for the most putable§ r. 

*' We may call to mind that the nth moment about one end of a frequency 
distribution divided by n\ is equal (by integrating n times by parts) to the n + lth 
integral from the other end. 

t Tables for StatisticiaiJS and Biometricians, Tahle IX. * Loc. cit. 

§ There is no most-probable r, since the sampling is from an unknown universal ; 
and, for the same reason, there are no probable errors to the calculated value of r. 
Theoretically we could calculate the ptitable errors, defining the range of putable 
error as that within which the universal must lie if the probability rank of the 
"sample is to be greater than J. If more than one constant is in question we could 
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Let u, V stand for f — =e-**V« and -=e"i2/ ^f^/, the 
y-»V2ir j-»V25r 

integral frequencies, or ranks, of the probability curve ; and let dashed 

letters indicate derivatives, so that u, v' are the ordinates and - u", - v" 

the first (neg.) derivatives etc. Denote by 8m, 8« the difference of rank 

of adjacent marginal divisions and by Sw 

the integral frequency of the enclosed cell. 

Then, by integrating (44) between 

divisional limits. 



8m 



Sv 



Sw = Su8v + Bu'Sv' .r+i 



'/'Sv"^_+. 



.(46) 



Now, if n is the observed number in the cell, the probability of the 
sample for the given classification is, by (6), 

taken for all cells. 

To make this a maximum requires n(Sw)" a maximum, or 2?ilog(8w) 
a maximum. 



The value of r is therefore given by 

n -Y- (8w) 
S^t— = 0, 



Sw 



<, { Sii'Sv' + Su"Sv" . r + Su"'Sv"' -—.+ ■■■) 



BiiSv + huh) . r + Bu'Bv" 



.(47) 



2! 



13. Returning now to (34), expressing the binomial frequencies 
referred to the mean as origin and a unit of measurement making the 
standard deviation o- and shown as a series in powers of 1/\/m, it is seen 
that the array coefficients are Thiele derivatives and that the binomial 
is equimomental, therefore, with the frequency series 



1 + 



V?!. 3 



d.rj n 



H<^-»l'(4.)'^«'^fi)}-' 




where q = {p' — p)l'i '^pp'. 

calculate the contour of values having the same property. Practically the work 
would be prohibitive. 
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14. The Gaussian a.._. aussian derivatives are curves with infinite 
limits in both directions and a series of such curves can be fitted by 
moments to a distribution of frequencies having a like characteristic. 

Let <^(a) express the given distribution in moment array. Multijjly 
by the expansion of c ~ 4 '^' and let the result be the series 1 + Ci a + (va- + . . . . 
Therefore 

^(a) = (l + fia + f,a-+ ...)(^4"' (48) 

and so the required resolution is effected, in arrays. 

If y =f{x) is the required frequency curve, by (43), the resolution 
in frequencies is 

If the mean of the given distribution is m and standard deviation o-, 
these may conveniently be taken as parameters of the arbitrary Gaussian, 
when the resolution to be performed is expressed, in arrays, 

<^(a) = (l + e3a' + c,a''+...)e™'^ + 4'^'''' (50) 

and, the c's being determined by the same expedient as described above, 
the resolution in frequencies is, by an obvious extension of (43), 

If a limited number, only, of the moments of the distribution are 
given, for instance, if <^ (a) is an empirical series, the c's are calculated 
to the same order and the fitted curves are equimomental to this order. 

If the distribution to be resolved have not infinite limits in both 
directions, other curves may with advantage take the place of the 
Gaussian, the treatment, by derivatives, being similar to what precedes. 
We will next consider curves suited to describe statistical frequencies 
extending to infinity in one direction, only, and their arrays. 

15. Gamma type dhtrihutions and tltelr derivatives. The frequencies 
having logarithmic decrement are expressed by the exponential curve 

y^e-", (52) 

of which the array is 

r g-^e<^'-crjx = -^ (53) 

.h 1 - " 

If random samples of p are taken, the total of the character .r in 
them, by (16), has the array 

(l-°r' 
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'^' T^h^-\7-^'"'^ ^''^ 

and so the Gamma curves 

are derived as the result of sampling the exponential frequencies (52). 
As in the last section we have, now provided s <p, 

<T^-=/:(-0(w)--- <^=> 

and hence that the left-hand member is the moment array of the 
frequencies 

y=[-dx)-npr ^ ^ 

If, then, (^(a) array the moments, up to the sth moment, of any 
frequency distribution having range to « and if we multiply by the 
expansion of (1-a)'', where p is any value >s, and obtain the series 
1 + Cia + Cija^ + . . ., it follows that, to the sth order of moments, 

<l>{o.) = {l + C^a + c^a^+ ...)j{l-o.Y, (57) 

thus resolving the array into a series of Gamma derivatives. 

If y=f{x) is the required frequency curve, the resolution is by (55) 

The resolution of frequency distributions into Gaussian and Gamma 
integrals and their derivatives presents an advantage for purposes of 
quadrature since (49), (51), (58) are clearly integrable with the help of 
existing tables*, the first term only requiring the tables of integrals. 



IV. SAMPLING WITHOUT REPLACEMENT AND PARTITION- 
ING A LIMITED POPULATION. HYPERGEOMETRIC AND 
KINDRED FREQUENCIES 

1. We will next enquire how far the symbols of denomination may 
be brought into use to simulate the fortuities arising when units and 
groups are taken and not replaced. 

Clearly, when samples are drawn en bloc from a limited population, 
the supposition made in (16) that each successive unit contributing to 

* Tables of the incomplete Gamma function are now at press. 



HYPERGEOMETRIC AND KINDRED FREQUENCIES 27 

a sample is drawn from an unchanging population U is no longer tenable 
and f "'" will not correctly array the samples of *■. A different algebraical 
analogue is required, which may be made quite general, and applicable 
to multiple sampling, as follows. 

2. Let C^ comprise / units A,m units B etc. {l+m + ... = n), so that 
in product symbol 

U=A'B"'... 
and let the sampling consist in partitioning U into samples of number 
Si, .«o . . . (si + *, + ...=n) and of designation »S', , S., .... 

Then any unit as ^-1 may fall into any sample and, if the appropriate 
suffix designate its destination, the alternative fortuities of A are 
arrayed 

AiSi + A,S.,+ .... 

Hence all the possible partitions of U are arrayed 

{A,S, + A,S,^ ...y (B,.s\ + B,S,+ ...}•"..., (.39) 

any single term such as A^"' B^''^ . . . .%'k A.^'-B.,"' ... S.f-^... signifying a 
particular partition off" into samples of Si, .v-., ... in number, viz., that 
partition in which sample 1 has r<i ^'s, bi B's... and sample 2 has 
a., A's, &2 B'a... and so on. 

"We require to extract from (59) all the terms in N,''''^'/'---, their 
number (by suppressing the symbols A, B...) being coefficient of 
•S'/i.SV^... in (.s'l + . y, + ...)" or w!/si!s,l... say "Cj^^,^ . 

If, therefore, from r=A'B'"... is taken a first sample of .Si, a second 
sample of s^ and so on, the last sample being the unselected remainder, 
the frequency array of the constitutions of the samples is expressed as 
the partial array, 

Array -S',*' .S'/^ . . . 

:{A,s, + A,.s.,+ ...y{B,s,+B,s,+...r...i"Cs^^,^„,...im) 

3. It will be observed that (59), with the ^"s omitted, merely arrays 
all the possible fortuities of the units by ascribing to their objective 
s)rmbols the distributive law of common algebra. The symbols <S'i , .V^ . . . 
are introduced to play the part of counters of sample numbers and 
enable the cases containing Si in sample 1, -s^ in sample 2 etc. to be 
segregated, as is done in (60). 

We have supposed the universal to be made up of separate kinds 
A, B ... but, it will be clear, that these symbols may stand for complex 
characters. For example, if A =FQ'R', B ^ F(i: Jl' ..., H=P(iB, in- 
dicating that three characters and their negatives are discerned in each 



28 SAMPLING A LIMITED POPULATION. 

individual, then the result will give the distributions of total-P, total-Q, 
total-^ in the samples. It is unnecessary however to elaborate special 
formulae since, by following the elementary principle just noticed, the 
frequency arrays for particular classes of problem may usually be written 
down without great difficulty. 

When the array has been expressed in symbols, it can be analysed, 
or combined, and, in particular, its moments obtained by the methods 
already explained. Four examples follow. 

4. As an elementary instance we will first find, by the present 
methods, the moment array of the hypergeometric frequencies, viz., the 
frequencies of A in samples of s drawn, without replacement, from a 
population of n units of whom I are A^. 

Suppressing the unnecessary not-^ and not-sample symbols, the 
frequency array is, by (60), 

Array S" ■.{l + ,ST-'{l + ASy/''C, (61) 

and the moment array is therefore 

Array S' ■.(1 + S)""-' (1+6-8)'/" Cs, (61') 

Array S':(l + Sf-' Ul + 8)+ (a + ^^+ .. .) >S'}7"C. 

, Is/ a-' \ l{l-l)s{s-l) / o? V ,^„, 

The first three moments about the origin are therefore 

k Is ^ l(l-l)s{s-l) Is ^ ^ l(l-l)s(s-l) ^ l(l-l)(l-2)s(s-l){s~2 ) 
n' n n(n-l) ' n m(re — 1) M(ra-l)(w — 2) ' 

and the rth moment about the origin may be expressed as 

Is l{l-\)s(s-\) m-i){l-2)s{s-\){s-2) ^ 
' n 2!m(ra-l) ' 3 !m (m- 1) (»- 2) ^'^ •■■' 

(62') 

where ^E,. is defined by 

(.»-i)^ = i-i?.^, 

r=l I ■ 

and the series terminates with the rth, Zth or sth term, whicliever is 
least (usually r). 

* The moments of the hypergeometrical frequencies are worked out by Pearson 
in hia paper 'On Certain Properties of the Hypergeometric Series,' Phil. Mag. 
voL L. (5th series) (1900), p. 157. 
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5. As a second illustration we may take the double liypergeometric* 
and find the frequency and moment arrays of the correlated numbers of 
A found in U\o separate samples of Si and Sg drawn from a population 
of « , of whom I are ^-l , without replacement. 

By what precedes, the frequency array will be, on suppressing the 
unrequired symbols, 

Array -S^uSV^= : (1 + S, + ,%)"-' (1 + ^,;S; + ^l,,s;)7"C:,^ , ,, -(63) 
and the moment array 
Array ,S'/= .s','^'-- : (1 + S, + ,S',)"-' (1 + e'^hS, + e'^S^frC,^^ ,^. 

Array S.'^S/^- : (1 + >% + &)""' |(1 + S, + ;S'o) + («i + ^ + •■■) 'S'l 

,fi(|^(a,.j!....y}.etc (64) 

The power moments, m^ , m,.^, about the origin, are the coefficients 

of ^ , a2_ ^^^ ^_^ ^^^ given by the initial and final terms of each chain 

bracket of (64) and are as given in (62) above. The product moments, 

m^ y , are the coefficients of the products -^ -fy and so are given by 

the intermediate terms and 

_ 1(1-1) s^.% l{l-l)(l^) \ sAs,-l) .s, ,„ i„ 
»»ri , ,. - -„ („ _ 1) 1 1 1 ! + n („ - 1) (m - 2) I 2! 1! '•' ''^ 

the series terminating usually in virtue of 'Er vanishing when i > r. 
Here {n -)''' = n{n-l).-- to i factors. 

♦ The moments of the double hypergeometrical series of frequencies are investi- 
gated by IsserUs in his paper ' The Application of the Solid Hypergeometrical Series 
to Frequency Distributions in Space,' Phil. Mag. vol. xxviii. (1914), p. 379. 
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6. As a third example let a population of n individuals be sampled, 

amongst whom two characters B, C are distributed B_ 

in such wise that a possess neither B nor C, b possess 
B only, c G only and d both B and C. The samples q 
of s are therefore expressed as 

Array S':{1 + Sf (1 + BSf (1 + CSf (1 + BCSffCs 
and the moments given by the development of 
Array S' : (1 + S)" (1 + e^S)" (1 + eyS)' (1 + e^+ySfrC^ 

: (l+^)« {(1 + ,S') + (/3+ f! + •■•)4'{(] +'^) + (r + |]+-) 4" 

(l+^) + (^^8T7 + ^J^'+...)^|y''f7, (66) 

Thus, referred to natural origin, the numbers of B and C in the 

samples of s, drawn without replacement, have for their moments 

mean {B) = {b + d) . sjn, 

mean {C) = (c + d). s/n, 

2nd moment (B) = {b + d). sjn + (b + d) {b + d-1) .s{s- l)jn (n - 1), 

2nd moment (C) = {c + d). sjn + {c + d) {c + d — 1) .s{s — l)/n (n - 1), 

product moment = d . s/n + {(b + d) {c + d) - d] . s {s — l)jn (w — 1), 

(67) 

and higher orders, for a time, not difficult to extract. 

7. In the three cases taken above either one or two 'alternative' 
characters are discerned in the individuals composing the limited popu- 
lation from which samples are taken. As a last illustration we will 
suppose measured characters, in any number, distributed with normal* 
or other frequencies, in the n members and obtain both as a partial and 
separate moment array the distribution of totals of the several characters, 
when random samples of s are withdrawn and replaced en bloc, or when 
the whole population is randomly partitioned into groups of given 
numbers. 

\i A, B... are the character symbols, an individual of the population 
possessing measures x, y ... in these characters has the symbol 

/, or A^B"..., or e^'-+v^+-, or e', say, 
where i, standing for a-a + yP + ..., shows his features expressed as a 
sum array. 

* No limited population of number n can truthfully be said to be distributed 
normally. Actually we assume the population to have all the Gaussian moments 
and surmise that, if n moments so agree, the result will not be much in error, for 
our purpose, by the disagreement of the remainder. 
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The array of the population is, therefore, 

-^A^'B"..., or is,", or say <^(a, ^...) (Osi) 

IS'ow in accordance with the methods of this chapter, if Ji' stands 
for 'rejected' and .s' for 'taken in sample' the totals of the characters 
in .samples of s are given as 

Array 7i"'-^s'< ■.n{R + /-S)/"C'., ' 

n repeating for the different values of/, or g^+!'?+--, n in number. 

We are given the moments of .r, y ■■■, viz. the coefficients of the 
moment array, <^ (a, ;8...), of the universal, and must therefore rearrange 
the above expression, as regards the variates ,r, y ■■■, to show only 4> 
coefficients. To do this, virtually we take logs and then expand each 
log (i? + e'S) in powers of i by Taylor's theorem. Actually we abbreviate 
by using the symbolic expression of the theorem. Putting S^e", we 
have a function of i + o- and write symbolically 

d 

Proceeding thus, the totals have expressions 
i?"-,S'^ f IS log (^ + ?'+=■) 



Array 



n — s ! .? ! n\ 

d 



eX^e i'^ log {R + «"■) 



d 



[by (C.si)] 



'^'"^{°-d^-' ^,7^---)log(^ + «') 






....(68ii) 



Hence (68 ii) shows the distribution of totals, for single sampling, 
as a partial array. To extract the array, put <^ = 1 + ^i, so that <^i (a, ^ . . .) 
stands for xa + y/3 + . . . + {x'a? + -ixy a/3 + ■fP'^ + . . . ) 2 ! + . . . in accordance 
with (24) or (68 i). Then the sample totals are 

R"-\S' (R + Sy ^ ( Sf] „Sd \, ,„ „ , .... 

Now if the ^i functional operator is expanded in powers of a, p ... 
and we get a coefficent (t^) log (J? + S), this is a series of powers of 
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-^ — ^, commencing with the first power. If we then expand the ex- 
ponential, we get a series of powers of ^ — ^-, preceded by unity. But 

where (s -)■' stands for s{s-l) ... j factors. Thus the partial array is 

s— 8 

given simply by putting — for ^ — ^ in the final expansion of the ex- 
ponential. 

The result can be expressed symbolically. For let T = -^ — s . 

.-. {f^\og{R + 8)=(T{l-T)^){-\og{\-T)}, (r>0) 

and, making the substitution, the expression (68iii) emerges, as ex- 
plained, as a function of T which, in the final power series, must be 

s ^ 
replaced by — , the powers of this having the meanings given. 

Hence, if from a population of n having any measured characters 
ivhose array is 

<i>ia,/3...) 

are drawn whole samples of s, the totals of the characters in a sample, 
on repetition of the experiment, have array 

n\^^\^aT(l-T)~, /3r (1- T) ^ ...]-l{- {-log (1 - T)} 

when this is expanded in powers of T and, in the end, T^ is replaced hy 
s{s-l) ... jfactorsjn(n~ 1) ...j factors. (68 v) 



To assist in the calcalation of such expressions as (68 ii), (68 v), we 
find that, if 

■8/ 



fe)''i°g(^^'-^)=*^:ra^^^(i?Tr> -^••■ 



' T{\-T)-^\-\og{l-T)] = hT+hT'+ . 



then the coeflScients Aj, k^... run as follows for r = 1; 2 ; 3; ... viz. 

1; 1-1; 1-3 + 2; 1-7 + 12-6; "l 

1-15 + 50-60 + 24; 1 - 31 + 180 - 390 + 360 - 120; ...J " "'^ ' 

In the special case of the population having a normal moment array, 
with zero means, by (39), 
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aud the moment array of total characters in samples of s is therefore 

As an example of the last case, if there is one character only, 
normally distributed, with s.D. o-, in the population of n individuals, 
and we require the moments of the total measures in samples of s, 
these have moment array, by (68vii), 

Oj._ gn{(I-T^)i,,V + (T-rr^ + 12T^-6T^)i,,'a'+...} by (68vi), 

or, l+n{T-r-)^<TV 

+ {n{T-lT'+12r~6T') + ir{T--2T'+ T*)] ^a'a* + ... ; 
2nd mt. s s (s - 1) 
a- n n{n-\) 

l?^=3»t.3«(»-„(^)'-e»,»-«)(iz)%3»(»-6)(^J. 

(68viii) 

If, lastly, instead of taking a sample, we randomly partition the 
population of « into groups of given numbers «i, S2 ... .<„ ..., by (60), in 
which now l = m= ... = 1, the totals in the several groups (distinguished 
by suffixes 1,2 ... g ...) are 



Array 



«i ! .So 1 . . . ' w ! 

el21og(e'i+<"i + e'2+'^=+...) 



w! 
the n and 2, as before, repeating, for every individual, the symbol 
meanings Z, = 4/^/ ..., ig = a:ag+y/3„+ .... 

* The value of B.^, or 4th mt./(2nd mt.)2, deduced, accords with the value 
obtained by Isserlis in his paper ' On the Value of a Mean as calculated from a 
Sample,' -Jo. Roy. Statistical Soc. vol. 81 (1918), p. 78. 

The reduction of such expressions as (68 viii) is facilitated by observing that the 

expansion symbolised by ( 1 ] must, by (68 iv), be 

Rn-Bss (R + S)" f S \' {R + S)'-JRi _ fn -s- \i 

For instance, 

s (s - 1) _(n- s) (n-s- 1) 



1-2- + 



n n(m-l) «(n-l) 
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Proceeding as before, we expand in powers of ti , t2 • ■ • by Taylor's 
theorem in its symbolic statement and get 

A A 

n\ 
^T-^'^("^^^°-i^+-- ^^^/^-^/-' ■■■)log(^^'+^'^-+-) 

[as is seen from the identity (68i), viz. Sg^°''+2/^' + --- = n<f>{a', /3'...), by 
putting tti^— + a2^— + ... for a', /?i^— + Sg t— + •■• for S' etc.l 

_ , / Sid Sid „ Sid „ 5*2^ \, ,„ ~ , 

iSi + S,+ . 



n\ 



eln {<!> (ditto) - 1} log {S\ + S„ +...). 



The function </> — 1 contains first and higher powers, only, of the 
operators and the exponent of e contains therefore only powers and 

products of ^ — ^^ , ^^ — ~ ... and the exponential expansion 

contains such terms preceded by unity. 
But, by the multinomial theorem. 

Array , , 

Sils,\... 

(s,+s,+...r ( s\ V / s, Y _ (si-Hfe-)^.. 

m! \Si + Si+...) Ui + ^2+.../ ■■■ (?i-)-*+*^- ■ 

Hence the required partial array, that is, the moment array of total 

characters for the specified partition Si, s.,-.. is given by writing ^— for 
^ etc. in this exponential expansion. 



Si + S,+ ... 

Hence, if a ^wpulation of n, having any measured characters distri- 
buted as the array 

<^(a,/3...), 

is randomly ^partitioned into groups {suffixes 1, 2 ...) of given numbers 
Si, §2 ••• , the totals of the characters to he found in the groups, in repeated 
partitioning s, have array obtained by expanding 



I / S,d S„d „ S,d „ S«d \ ] 



■) 
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in poHvrs and products of 

<S'i + (So + . . . ' )S\ ^ ,s', + ... ' ■ ■ ' 
and puttinij for these 

n-' »- ■■■' 
and putting, finallij, 

{si —y = Si{si- 1) ...y factors etc (68 ix) 

In accordance with (-'6) the result for single sampling (our first case) 
is obtained by writing aj, a^ ... (in... etc. zero, leaving an array in 
1*1, /Jj..., and that for double sampling in any given numbers is 
obtained by writing all the symbols zero but those with suffix 1 or 2. 

If the moments of means are needed, in place of those of totals, in 
accordance with (25) we write 

tti/Si, P\ls.^ ... a,,/.s\ ... etc. 

for Oj, /?!... ao ... etc. 

If it is required to find the various power and product moments, in 
sampling, of the higher moments, vice the means of the simple measures, 
of characters, the formulae will give these, since powers of measures maj- 
be taken as the characters considered. Arrays are very simply referred 
to means, viz. by suppressing the linear terms in their exponential 
expression [see (39), (40) e.g.]. 

Looking at the kind of result recorded in the problems dealt with in 
this chapter the difficulties are realized that are likely to be encountered 
in attempting to obtain the terms contributing to the higher moments 
of this class of distributions in any piecemeal way instead of as a 
whole. 



V. GEOMETRICAL DISTRIBUTIONS. SAMPLES OF VECTORS. 
RANDOM MIGRATION 

1. The symbols A, B, C ■.., which, hitherto, have symbolised unit 
characters generally, may as a special case stand for unit steps or 
vectors in given directions. JC, Y, Z may conveniently signify unit 
vectors parallel to the coordinate axes and, then, according to pre.sent 
methods, X-'- will present a step, parallel to the axis of .r, of x units 
and X'^'Y^Z'' will present the point whose coordinates are x, y, z 
relative to the origin. 

3—2 
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The frequency array 

will present an assemblage of points, steps, or vectors, Px.y.z being the 
probability or frequency of representative points at x, y, z and 

will array the total steps or resultant vectors when samples of n are 
drawn from V and added. 

As before, the Greek letters f , v, t will stand for log X, log Y, log Z 
in the algebraical transformations which counterfeit the selective pro- 
cesses. 

2. If lines and geometrical extensions may be looked upon as 
aggregates of closely and equally spaced points, they may be shown as 
frequency arrays by single expressions. 

Thus the line between x = —l and a; = 1 is expressed as 

/■I -^r,dx fi .dx sinh^ , > 

From this we infer that the means of randomly chosen sets of n points 
in the line has moment array 



(^T («»^> 



whence by taking the coefficients of 

2! ' 4! ' 6! ' 
in the expansion, the 2nd, 4th and 6th moments are 

^, {| + i(«-l)}/< {i+(„_i) + |(„_i)(„_2)}/«=. ...(70) 

3. Another use to which line arrays may be put is in the quadrature 
or smoothing of arrays. For, if 4> (i) is the array representation of any 
frequency curve, 

, , .- sinha^ 
^(^)>< -^' (71) 

by (17), will array the result of taking an ordinate of the curve with a 
point on the line. 

(71) therefore arrays the means of frequencies extending to the 
distance a on either side of the ordinate. 
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1 liA = A''Y"Z', 5 = A'^^yZ- are vectors, J5 is the resultant 
and, if a, fi are defined by A =ef, B = e'', we may say that the resultant 
of a and /3 is a + ;S. 

Thus the moment symbols have the characteristics of the vector 
symbols of the usual theory. Also 

i, V, i being the moment symbols of unit vectors along the axes, 
defined by A' =t*', Y=0^, Z=e'=. 

5. The limited line joining two points A, 5 is expressed in array 
form as 



/. 



%.«o.-.i,,, „ ^, „. ^J „,) 



/o P-o.' "' log5/A 

By a similarly constructed integral it may be shown that the space 
of the triangle A, B, C has as its expression 

2("' 2(^3 2ev 



.(73) 



(a-^)(a-y) (^ - a) (/3 - y) (y_a)(y-^)- 

Since the three middle points of the sides have array 
l^i(a + ^) + |f,i{a + 7) + i^eW + y) 

and both expressions have expansion beginning 

1 + J (a + /3 + y) + Jj (a^ + ^i! + / + a^ + ay + /37) + . . . , 

it follows that the first and second order moments are the same for the 
triangular area and for the three mid-points of the sides. 

It will be clear that the points in the lines and geometrical extensions 
may, on occasion, be regarded also as being unequally spaced or as having 
distributions of density. 

Thus if y ^- /3 in the above triangular area we have, in the limit, 
a line of points whose density increases uniformly from zero at the end a. 
Such a limited line, then, will have as its array 

{o-'PT ^ ' 

It may here be recalled to mind that all our arrays represent distri- 
butions of frequency, the total frequency being unity, or occasionally 
zero. There is therefore no question of the absolute number of points. 
In the actual universal it is generally infinite : in the frequency array 
it is generally one in number. 

6. Polar symbols in two dimensional distrilnitions. When distribu- 
tions of points in a plane are given in polar coordinates r, d, their arrays 
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may be expressed in terms of two symbols t, v, defined as connected 
with the symbols i, r/ by tbe relations 

| = TC0SV, rj = TSiav, ^^ + -rf=T'^- 

The symbol for tbe point x, y is, therefore, 

X^Y\ or e^f+^Z", or /^ <=os (v - e)^ 
and an array of points will be expressed as 

\F{r,e)i'"'°^^''-^^d/rdQ, (75) 



//' 



taken between definite limits, F{r, 6) drdO being tbe frequency of points 
at r, 6, wbose integral between the same, given, limits is generally 
unity. 

7. Circular distributions. Vector sampling ivheii all directions in 
the plane are equally probable. In polar symbols, therefore, a circle of 
radius r from the origin is expressed as 

/■''I grrCOs(.-e)^^_ or /„(?-r)* 

and a distribution of points in circles about the origin as 

'i>{r)=r.f(r)I„{rT)dr (76) 

Jo 

In a great many problems presented in physics we require the 
fortuities resulting from taking samples of vectors, in known numbers, 
and compounding them. The tensors are sometimes given, but usually 
they are fortuitous, with given frequencies, and the directions, in these 
problems, are supposed purely random, that is to say, all equally 
probable. 

The universals are therefore equivalent to distributions of points in 

* We may define the Bessel function with imaginary argument and obtain its 
expansion thus : 

In W \_=Jn{i^)li"] =1^" cos n^ . e^cos.1, ^ 

f^" , I . .of \d^ 

= 1 cosmi/' ... + cos*vI'-;+... I ;r^ 

/"Stt / slicosnip X' \d\l/ , 

i ' fin+9 „ 1 1 I T I "^ '" "t" (i 



"a^re! 2''+2m + I!ir ■•■^2"+2'-n + r!)-!~ 



and we may deduce in like manner that I sin ni// . e* ™^ ''' ^ = 0. 

Jo ^'^ 
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circles, expressed as (7G), and the methods of arraying samples drawn 
from uuiversals, described in ii, 2, enable us to write down immediately 
the distribution of the resultant vector as an arraj-, i/f (t), in the tensor 
symbol t. 

This represents another distribution of points in circles and the 
frequencies of the tensor r are therefore, by (76), to be found by the 
solution for h (r) of the integral equation 

Jo 
The solution, by Hankel's formula, is 



'.{r)=l r.rJ„(r.v)^(Lr)d.r (77) 



As an example, if the conditions of the problem state that one step 
of length li is to be taken from a starting point, in random direction, 
followed by a second of length L, also random, and so on, to n steps, 
this is the same as taking one sample from the circle I„(Iit), one from 
Io{Lt), and so on, and therefore, by (17), 

<i'{r) = h{kr) /„(4t).../„(4t) 

will array the resultant step, viz. that obtained by compounding the 
sums of the characters X, Y to be found in the samples. 

We therefore, in this case, reach Klu3rver's* formula for frequency 
of final excursion 



h(r)=\ r.rJ„{r.v)Jo{kx)...J„(l„x)dx (78 

Jd 



If, on the other hand, as is more usual in molecular and other 
physical problems, tensors are fortuitous, with given frequencies F(r)dr, 
we are now taking n samples from 






F{r)I,{rT)3r, 
Jo 

and these, by (16), will have their resultants arrayed as this expression 

raised to the power n. 

Hence, in this instance, the frequency of resultant tensor length is 

h(r)=l rxJ,{rx)U F{r)J,{rx)dX dx (79) 

* For references to Kluyver's paper (1905) see Pearson and Blakeman, 'A Mathe- 
matical Theory of Random Migration,' Drapers' Company Research Memoirs, Bio- 
metrio Series in (1906), and Eayleigh, 'On the Problem of Random Vibrations and 
of Random Flights in One, Two, or Three Dimensions,' Phil. Mag. Vol. xxxvii 
(1919), p. 329. 
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8. Moments of tensors of circular distributions. It was shown in 
II, 3, that the array of a simple character, A , when expanded in powers 
of the symbol a, gave the sth moment of the frequencies as the coefficient 
of a7s!, or, that 

, a? 

4>{a) = l +OT.ia + »J2— ; + .... 

It appears from (76), by expanding /„ (rr), that an array of points 
in concentric circles, when expanded in powers of the tensor symbol t, 
gives the 2sth moment of the tensor as the coefficient of t^/2''s ! s ! , 
or, that 

<^« = l+'«2^S^-'«4 2J^ + -- ■ (80) 

Thus tensor arrays are moment arrays, although in a sense that 
needs to be restated. 

As an example let us find the moments of the resultant tensor in 
terms of the moments of the universal of tensors from which samples of 
n are taken and compounded at random angles. 

Suppose, for instance, that ' free paths ' are considered, liable to such 
fortuities that the moments of the frequencies of length of path are 
OTa, mi-.- and that forward and backward and all directions of path are 
equally probable. Then (80) is the universal of ' free paths.' 

If n such paths are taken, in sequence, randomly chosen as to length 
and direction, the chances of final distance reached from the starting 
point will have another distribution i^ir); and if J/,, il/, ... are the 
moments of this distribution, by (80), 

But, by (16), ^(r) = {<^(T)}''- 

Hence the moments of the final transference, after n paths have been 
traversed, are related to the moments of the probabihty curve of length 
of path by the equations 

3/2 = nm^, 

Mi = nnii + 2h (w — 1) m.?, 

Ms '= nm^ + 9n (n—l) m^nii + 6w (n — l) {n — 2) m.?, 

Ms = wws + 2« (ra — 1 ) (9?7z/ + 8mo m^) 

+ 12n {n - 1) (« - 2) mimi 

+ 24?z(n-l) (m-2)(w-3)OT2^ , (81) 

Thus, where the solution, (79), of the problem of vector sampling 
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presents difficulties, either by reason oi F{r) being observational, or, if 
a known function, presenting difficulties in integration, the solution 
(SI), b}' moments, will sometimes enable approximate curves to be 
fitted. 

Before dealing with as3'mmetric distributions of vectors and the 
chances of resultant tensor and direction in taking random samples 
therefrom, a few propositions are here given to illustrate how arrays 
may give new interpretations to results and means of reaching them. 

9. The si/mmetr/cal Oni/ssiai). If a; y vary independently, with 
normal errors, the s.d.'s being unity, the array of points, by (35), is 

1 12 1 _2 1 -.2 

(,ts X e*'' or ('-"^ . 
But the distribution is circular, the frequency of point being 

or ~e-i''\rdrdd, 

that is re^'^^'dr in the zone dr. 
Hence, by (76), 

e'--''=\^re-i'"-I,{rT)dr, (82) 

which is a form of Weber's theorem. 

10. Circular area. An even distribution of points within a circle of 
radius a from the origin, by (76), has array 



I 



r , , 'Irdr 2/i(rtT) , ^ 

h{rr)^ or — iA--' 83) 

a- (IT ^ ' 



11. Uneven distriOution of points on a circle. If the points 
^(iT COS (v- 8} 

on a circle of radius a are distributed with the harmonic density 

cos {s6 + yg) — 
the array is 

/ '"cos {se -r y,) e""- ""^ (" - «) 1^ or I, On) cos (sv + y.). . . . (8 4) 

Jo -i'T 

Thus any uneven distribution of points in a circle can, by Fourier's 
theorem, be shown as the sum of harmonic circular arrays such as (84). 
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12. Single points. In particular the point («, 6), whose symbol is 

-ar COS (v - 8) 

is, by the Fourier expansion, expressible as 

/„(aT) + 2 2 I,{ar)coss{v-e) (85) 

s=l 

In like manner the point {b, + C) is expressible as 

I, {bT)'+2^Is{bT) cos s{v-e-C) (86) 

s = l 

The resultant of the two steps a and b, inclined at the angle C, 
has tensor 

c = J{a^ + b- + 2ab cos C). 
But the resultant of the two steps (85), 
(86), by V, 4, has as its expression the product 
of (85), (86) symbolising the steps. 

Let 6 take all values from to 2ir with 
frequency ddftir, a, b, G being constant. 

Hence we conclude that the circle of radius c equals the integral of 
the product of (85) and (86) into dOj^Tv ; 
:. /„{ s/(«' + &' + 2a& cos C)t} 

= /o («t) /„ {br) + 2% Is (ar) Is (br) COS sG, (87) 

which is the same as Neumann's theorem. 

13. Non-circular distributions. Vector sampling when directions 
have bias. 

If, firstly, the frequencies of r and 6 are independent of one another, 
the array of points in a plane, shown generally in (75), will take 
the form 




/> 



\f{r) g {6)e"''°^^''-^) drdei^TT, (88) 

and, if g {&) is expanded in a Fourier series, 

9{&) = g<,+ ^9s<ios{se + ys), 

the array is, on integration with respect to 6, 

(f(r) {golo (rr) + S.gJs (rr) cos {sv + -y,)} dr (89) 



A 



Thus (89) is the array representation, in polar symbols, of any 
distribution of points, in two dimensional space, for which r, 6 are 
independent variates. 
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If, secondly, we renio\e the restriction that ;/„, //, , y,, are independent 
of /•, (89) may represent any distribution of points, whatever, in a 
plane. 

In either case, on integration with respect to r between the definite 
limits, we obtain an array of the form 

<^ (t) + 2 ^s (t) cos (sv + Es), (90) 

in which t is the tensor symbol, v is the versor symbol and all else are 
constants. 

If from uuiversals as (90) we take samples, in given numbers, by 
(17) we may at once array the resultant points, or the points whose 
coordinates are the sums of the coordinates of the sampled points. 
Such arrays will take the form U' V™]V" ... where C, V, W ... are the 
universals from which l,m, n ... individuals are taken at random to form 
the sample. 

Thus any problems in sampling of vector distributions in two 
dimensional space can be solved so far as to express the result as an 
array, in the form (90). 

Clearl)-, since the indices such as I, m ... are integers, if the gonial 
functions g (0) are expressible as a limited number of Fourier terms, the 
resultant arraj', in such a problem, will be expressed in a limited number 
of terms as 

^/z (t) + 2 i/'s (t) cos (sv + rjs), 

with, now, different s values. 

In order to arrive at frequencies, the integral equation to be solved 
is, by (89), 

i/f (t) + 2 i/'j (t) cos (.su + i?s) = I /i{r) \l\,I,(rT) + 'S,k,I,(rT)cos(sv + K,)]dr. 

s Jo s 

(91) 

If (91) is an identity in the symbols t, v, it follows that the values 
of s are the same for the two sides ; and that 

and that h (r) /,;,, h (r) ks must satisfy 

V'(t)= r h{r) /.■„!„ {rT)dr, 

JO 



t 



',{r)=r h{r)kMrT)dr. 
Jo 
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Hence, by the theorem already quoted, 

h if) ^0 = / rx Jo {rx) i/' {ix) dx, 
Jo 

h(r)ks= I rxJs (rx) fg {ix) dx 
Jo 

and the frequency of, resultant point, or vector, is 

k (r) {ko + S^s cos {s9 + rjg)} drdO, 



or 



I rx {Jo (rx) \p {ix) + 2 Js(rx) ^/siiso) cos {s6 + %)} dx drddj^ir. 

■ Jo s -1 

(92) 

14. Polar symbols in three dimensional distributions. If x = r cos 6, 
y = r sin 9cos(f>, z = r sin 6 sin <l>, then the symbols |, y}, t, may be replaced 
by T, V, 0) defined by 

i = T cos V, rj = T siu V cos CD, ^ = r sin V sin m 
and the point x, y, z therefore has symbols 

X^'-yZ^ or g^^ + 2"'+^i', or e'-^<=os,T 

A 

where rr is the expression for the angle between 'directions' r and t 
and 

A 

COS rr = cos 6 cos u + sin 6 cos <^ sin v cos w + sin 6 sin <^ sin u sin w 

= J [cos (v - e) {1 + cos (w - <^)} + cos (v + 6) {1 - cos (o) - ^)}]. 

(93) 

A distribution of points, whose frequency at r, 6, 4> is 

F(r, 6, 4>)drded'i>, 
will be arrayed as 

\^^F{r, 6, 4>)e'-^''°^''^drded4>, (94) 

taken between and =o , or definite limits, for r, between and tt for 6 
and between and 2ir for 4>. 

15. Splmrical distributions and vector sampling ivhsn all directions 
in space are equally probable. Hence points uniformly distributed over 
a spherical shell of radius r will have as their expression 



I'lsmBe'^^^'^^de, or "^^ (95) 



^0 rT 

Thus the expression for a spherical shell is the same in form as the 
expression, by (69), for the polar diameter, but the interpretation of the 
symbol is different in the two cases. 
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A distribution of points in space, the same in all directions, will, in 
consequeuce, be shown as 

rfi,-)'''^-^dr, (96) 

Jo IT 

where /'('■) '^■'' is the frequency of points at distance r. 

Denoting this by <^ (t), then 4> (t) may represent a universal of points, 
steps, or vectors, from which samples are drawn at random. 

If n samples are drawn, the resultant vector, compounded of the 
samplings, by (16), will have as its array f (t) = {cl> (t)]^- 

As in the two dimensional case, 4> i'^) is a moment array, and, by (96), 

2 -4 

4>{T) = l + m,^ + 7n,~ + (97) 

But 4' (t) is also a distribution of points in spherical shells, and so 
the moments of the resultant tensor are given by the expansion 

^(T) = l + J/,^ + iV,^ + (98) 

From the simple relation of ij/ (t) to 4> (t) we find the solution in 
moments of this problem 

J/, = nT/ii + f w (w - 1) Wa^ 

J/j, = nrris +7n{n— 1) nhmi + ^pM (w - 1) (« — 2) m-i", 

J/s = nms + f w (» - 1) (21m/ + ^Om.iin^ 

+ 42w(M-l)(n-2)OT2'«24 + %^-m(«-l)(?J-2)(w-3)?w/. ...(99) 
"When, by the methods of Oh. ii, the distribution of the resultants 
of samples drawn in any manner from given universals such as (/>(t) has 
been expressed as an array, i/'(t), in the tensor symbol, we pass, by (96), 
to the solution in frequencies, say h (r) dr, of tensor length, by solving 
the integral equation 

^(t) = J^ }i{r) ~^^Ar. 

The solution, by Fourier's inversion formula*, is 

^(r) = - rxws.{rx)^{ix)dx, (100) 

* Fourier's inversion formula 

i/-(t)= j(T)smna.r, j{r)=-\ f{t)smrtdt 
Jo ""jo 

is the particular case of Hankel's inversion formula 

f(t)=rj{T)rJ^{n)d,T, /(r) = j^^i^{t)tJ„{rt]dt, 
when n=J, in virtue of the relation J,(x)j ^ x= Jljir s,'mxjx. 
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in which the imaginary is, of course, apparent only, the arrays <^(t), 
"/-(t) being even functions of t. 

In particular, if n vectors of length I are randomly directed in space, 
we are, in effect, drawing n sample points from the spherical surface 
sinh ItJIt and the array of resultants will, therefore, be 
iA(t) = (sinh hlhf. 

The frequency of tensor length r will, therefore, by (100), be 
h (?•) dr, where 

h(r) = - I rx sin rx (sin IxllasY clr, 
"■Jo 

in agreement with Rayleigh's formula*. 

16. Fortuities of resultant tensor wlien n randomly selected tensois 
are set in random directions in a space of s dimensions. The generalisa- 
tion of the foregoing formulae, given for onet, two, and three dimensions, 
to the case of s dimensions, directions being random, is not difficult and 
may be indicated briefly as follows. 

Clearly the denominator, or symbol, for the point is as before 

X^ Y'JZ~' . . . , or e^^ + 2''' + ^f + • ■ •, or e'"'" <=°^ "", 
where r is the radius vector, t the tensor symbol, defined much as in 

A 

the previous three-dimensional case, and rr, as there, is the algebraical 
expression for the angle between the actual direction r and the con- 
ceptual direction t in the s-dimensional space. 

Calling this angle 6, then in order to form the frequency array of a 
uniform spherical 'surface' of radius r in this space, the frequency, or 
numerator, of the point denominated g"'=°^* is the 'area' of the zone 
d6 divided by the 'area' of the sphere. Now if o-g(j-') designate the 

dP 

* Loc. cit. p. 341, formula (59), in which -^-^ is, in his notation, the frequency 

coefiicient h{r). 

t The parallel theorem in one dimension leads to a solution of the 'problem of 

moments. ' cosh x^ or J ( A'^ + Z~^) is a doublet of points and, if <p{^) is an even 

moment array, the solution of 

/"" 2 /"" 

il/(f)=l h(x)cosiix^dx is h{x)= - I cos ux .\l/{iu)du. 
jo ■^ J 

sinh .rf or J (A'^ - A'-') is a doublet of zero frequency and, if x (I) is an odd moment 
array, the solution of x(|)= I A: (.i;) sinh .r| dr is li{x) = - ( simix.ix(i^)'^>'- If, 

./ IT _/ 

then, 0(?) = \J'(f)4-x(f) is any moment array as (23), divided into even and odd 
arrays, the solution in frequencies is 

f{x) = h{x) + k{x) = ~ I {cosaa;. \j/ {iu) + sin ux . ixiiu)} du (105) 
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'surface' of a sphere of radius /■ in ^-dimensional space, the 'area' of 
zone d6 is rd6 . 0-^-1 (r sin 6) and the frequem-y of the point is therefore 
rdO . o-,_i (r sin e)/o; (r). 

Clearly this is sin'''- 6 dd / BI~, —^j, since the power of sin 6 

follows from dimensions and the divisor follows from the consideration 
that the integral from to -t is unit}'. 

Thus a spherical 'surface' of radius r is 



I 



, or r ^° " " -' doi") 

5(l,izl) (irT)i^-l ■■ ^''") 



This expression reduces to Z.irr) when s = 2 and to sinhrr/'/T when 
s = 3 (see foregoing note) thus giving the arrays of circle and spherical 
surface abeady found in the two and three dimensional cases. 

Proceeding as hefore, a distribution of points in uniform generalised 
spherical shells is shown as 



/■" !tS-l]li,_il.rr) 



/{r) dr being the frequency of points in shell dr at distance r. 

This being a frequency array, any problem requiring the distribution 
of resultants of samples obtained in any manner from such universals is 
at once, by the methods of Ch. n, expressed as a kindred frequency 
array. For instance, if <t'(j) stands for the above universal and samples 
of n are taken at random as to length and direction, the resultants have 
array 

Secijndly, such arrays as 4> (j), ip C'^) are moment arrays and when 
expanded in powers of the symbol, t, give the moments of the tensor 
length as certain coefficients. In the present case, by (K'l'j, 

^(,) = 1 + ^^_I_ + ^^__^1__+ (103) 

And, thirdly, from the solution of the problem as a frequency array, 
xj/ (t), we may obtain the frequencies, h (r) dr, of the tensor length by 
solving the integral equation 



•A 



{T)=rh{r) 
Jo 






and the solution is, by a further application of Hankel's formula, 



I (r) = r ' -,^' lA (ix) dx (104) 



48 GEOMETEICAL DISTRIBUTIONS. SAMPLES OF VECTOES 

In the foregoing pages illustrations are given of the assistance that 
objective symbols can render in the specification of numerical aggre- 
gates, whether seen in a single or manifold classification. Such symbols 
are of the simplest character, answering as they do to the common noun 
in speech, and it is not surprising to find that their use brings to the 
work of analysis a brevity that the algebraical symbols, with their more 
involved connotation, fail to give. That the equation f{x, y) = aptly 
describes a relation between the number x of the literal X and the 
number y of the literal Y is beyond doubt, but if the purpose be merely 
to state that there are 15 of J_^and 20 of .Y then either the expression 
ltX+ 20 Y or the symbol X^ Y"^ appear to fulfil such requirements in 
a sufficient manner and more simply than the equations a;=\b, 3/= 20. 

The aim, then, has been to recommend the use of logical symbols in 
the enumeration of logical classes. It has been no material part of the 
purpose to establish new formulae and results in the mathematical theory 
of statistics and if new conclusions have been reached these will serve 
chiefly to help point the precept, since no difficult analysis has been 
undertaken or is anywhere involved. 

It is possible that, with wider familiarity in their use, the applications 
of the symbols of denomination may bear extension and that they may 
be found of assistance in the development of the higher theory both of 
statistical and other distributions. 
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